Statistical Properties of Demand Fluctuation in the Financial Market 
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We examine the out-of-equilibrium phase reported by Plerou et. al. in Nature, 421, 130 (2003) 
using the data of the New York stock market (NYSE) between the years 2001 -2002. We find that 
the observed two phase phenomenon is an artifact of the definition of the control parameter coupled 
with the nature of the probability distribution function of the share volume. We reproduce the two 
phase behavior by a simple simulation demonstrating the absence of any collective phenomenon. 
We further report some interesting statistical regularities of the demand fluctuation of the financial 
market. 
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Recently a report based on New York stock exchange 
(NYSE) data between period 1995-1996 ref. Q reported 
that the financial market has two phases, namely, the 
"equilibrium" phase, and the "out-of-equilibrium" phase. 
Ref. [l| further reported a critical point which is the 
boundary between a and finite value of the order pa- 
rameter for the observed phase transition. 

In this paper we address the following questions in 
an effort to understand the observed two phase phe- 
nomenon: 

1. What is the cause of the observed two phase be- 
havior ? 



2. Do large changes 
equilibrium phase ? 



of price occur 



out-of- 



Our study is based on the NYSE Trades and Quotes 
(TAQ) database for the period 2001-2002 which records 
every ask, bid, and transaction price. 

Figurengives an estimate of the percentage of transac- 
tions during the period period 2001-2002 that was either 
at the ask price (scaled to be 1), at the bid price (scaled 
to be -1), or at an intermediate value between the ask 
and the bid prices. 

Using the method of ref. we first identify a buyer 
initiated or a seller initiated transaction by a quantity a 
which is defined as follows: 




p > (ask - bid) 12 
p = (ask — bid)/ 2 
p < (ask - bid)/2 



(1) 



where p is a transaction price. Usually a transaction is 
executed when a new quote hits the lowest ask price or 
the highest bid price. 

As in ref. 0, we define the volume imbalance Q, and 
its standard deviation as 



n(t) 



N(t) 



(2) 
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FIG. 1: Percentage of transactions occurring at ask, bid 
and/or an intermediate price between ask and bid prices 
which are normalized to +1 and -1, respectively. We observe 
that nearly 1/3 of the transactions take place in each one of 
these three groups. 



E(t) =< \qitti- < qiai > t | > t 



(3) 



where qi is the share volume of the ith transaction within 
the period At. 

Fig. |3 displays the probability density function (PDF) 
P(n|X) of £1 for a given S. We observe a bi-modal 
PDF with two extrema when S is larger than a critical 
value. This feature of P(0|S) is reported as an instance 
of two phase behavior in the financial market [Fig. 1 
(a)of ref. Q]. 

We next accumulate the a between intervals t and t + 

N(t) 

At to estimate a quantity $ = "^2 a>i which measures 

i=l 

the number imbalance between buyer and seller initiated 
trade within the time interval At. 

The first moment a of $ is defined as 



a(t) =< a,- < a, > t \ > t 



(4) 
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FIG. 2: Probability density function P(Q|E) for given E. We 
observe that P(fi|E) for E > 0. Reference Q interprets the 
two maximas of P(fi|E) as the two phases of the financial 
market. 



TABLE II: 
tion. 



Different combinations of a, q and N for Simula- 



a 


random 1,-1 


Normal(0,l) 


q 


power law q~ L ' b 


Normal(10,3) 


N 


power law A~ a ' 4 


Normal(10,3) 



TABLE III: Correlations of pair (q, E) and (N, E). 



Correlation 




(A,E) 


TAQ 2000-2001 database 


0.69 


0.01 


Simulation with q, N 
distributed as in Fig 3a 


0.95 


0.00 


Simulation with q, N 
distributed as in Fig 3b 


0.08 


0.03 


Simulation with q, N 
distributed as in Fig 3c 


0.71 


0.00 


Simulation with q, N 
distributed as in Fig 3d 


0.19 


0.11 



TABLE I: Correlations 





\R\ 


TV 


q 


E 


a 


\R\ 


1.00 


0.10 


0.20 


0.11 


0.11 


N 




1.00 


0.03 


0.01 


0.50 


q 






1.00 


0.69 


0.14 


E 








1.00 


0.08 


a 










1.00 



where < ... > t represents an average in the time interval 
At. 

We also define 

«W = W) g * ( 5 ) 

which measures the average volume of trade per transac- 
tions between t and t + At. 

We estimate Q(t), E(i), and q(t) averaged over 102 
stocks which has the largest total volume among all 
NYSE stocks in 2002 with At = 15 minutes. 0. 

In phase transitions observed in physical systems, the 
control parameters are independent variables. But in 
ref. , the control parameter E is the magnitude of fluc- 
tuation of the order parameter Q. When fluctuations 
are large, the underlying PDF of the random variable is 
wide, leading us to think that we might replicate P(0|£) 
by simulation. 

First we evaluate the empirical correlations present in 
the database between different variables defined in eq. 1- 
5. Table H] tabulates the estimated correlations. 

Next, for simulation, we consider a, q, and N to be 
identically and independently distributed (i.i.d) from two 
possible PDFs given in Table ITT1 . We execute the sim- 
ulation for eight (2 3 ) possible combinations of the set a, 
q and N. 



Since a, q, and are i.i.d, there are no autocorrela- 
tions like that in the TAQ time series, but even then, 
in we find P(f2|E) to have a bi-modal distribution only 
when the PDF of the share volume is a power law (c.f. 
Fig 3). This observation can be explained as follows: 
First note that the PDF P{q) of q has the functional 
form of a power law, i.e., P(q) ~ q~ 15 . Next note that 
the has a q in its definition, thus the PDF P(f2) of 
also has the functional form of a power law. Occurrence 
of extreme positive and negative events are much more 
probable when P(fi) has a fat tail. Since S by definition 
is the standard deviation of the random variable Q, large 
values of £ occur when extreme events (both positive 
and negative) of are sampled. Thus P(f2|S) has two 
extrema resulting from large positive and negative sam- 
pling of the random variable f2 when we choose groups 
with large E values [c.f. Fig. 02a) and Etc)]. 

Table Hill tabulates the estimated correlations between 
the pairs q, E, and N, E for the case of TAQ database 
and simulation. We find correlation values for the TAQ 
database and simulation to be statistically similar when 
q and N are power law distributed, which further demon- 
strates that the observed two phase effect is not a sig- 
nature of hidden collective phenomena within financial 
market. An explanation similar to the one above is re- 
ported in ref. 3J. 

To examine this in closest detail, we next estimate the 
PDF of q and N for a given E from the TAQ database. 
FigEIa) show, that q is large for large E. FigQJb) shows 
that N is almost independent of E. By this, we can infer 
that the bi-modal PDF P(f2|E) is caused by the trans- 
actions of large investors with large number of shares. 
The small decrease in < N > as E increases has a simple 
explanation, which was pointed out by our referee. In 
normal markets the specialist, whose role is to maintain 
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FIG. 3: P(ft|E) with fi and E generated from i.i.d., for 
four different combinations of a, q, N. We observe a bi- 
modal distribution of P(fi|E) only when P(q) has a power 
law functional form. Since P(q) ~ ? -1 ' 5 and the definition of 
fi has q in its argument, P(fi) also has the functional form 
of a power law. Thus for the power law distributed P(fi) 
occurrence of extreme positive and negative events are much 
more probable than if P(fi) did not have a fat tail. E by 
definition is the standard deviation of the random variable fi. 
Large values of E occurs when extreme events (both positive 
and negative) are sampled. Thus P(fi|£) has two extremas 
resulting from a large positive and negative sampling of the 
random variable fi when groups with large E values are chosen 
[c.f. (a) and (c)]. 
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FIG. 4: (a) Probability density function P(g|£) of the volume 
of shares for given E. Note that q, N, and E are normalized 
quantities with their average value subtracted. We observe 
that the transaction volumes are higher for groups with large 
E. (b) PDF P(iV|£) of N for given E. We observe that 
the mean number of transaction < N > slightly decreases for 
groups with high E. 



a fair and orderly market, crosses the book via his clerk 
very regularly in an almost automated fashion. When 
fluctuations become large the specialist must take a close 
look at the electronic order book and at the order from 
the floor before he may decide which and at what price 
it is fair to cross the orders. This manual intervention 
takes time and is a plausible cause of the decrease in the 
number of trades. 

To have a better understanding of how the 
absolute value of the price fluctuations \R\ ef- 
fects E, a, q, N, we next study the PDF 
P(E||-R|), P(<t\\R\), P(q\\R\), P(N\\R\) for given \R\ us- 
ing the TAQ database. First we sort the database of 
102 x 26 x 400 items ( 102 stocks, 26 terms per a day, 
about 400 days in two years) with respect to \R\. Next 
we divided these items into four groups bordered by 99.9 
percentile, 99 percentile and 90 percentile. Figure[S]plots 
thePDFP(E||i?|), P(cr\\R\), P(q\\R\), P{N\\R\) of each 



4 



of these groups. 

We observe that even though \R\ is not correlated to 
£ or q [c.f table 0, the PDF P(T,\\R\), P(q\\R\) exhibits 
significant differences among different groups of \R\. This 
is because rarely occurring large \R\ causes large fluctua- 
tions but does not significantly contribute to correlation. 

In contrast other quantities such as a and N show there 
is not so much of a difference. FigOJd) again shows the 
same effect as seen in Fig0Jb) where large fluctuations 
decrease the number of transactions within a specified 
time interval. 

In conclusion it can be inferred that the "two-phase be- 
havior" as reported in 0] gives no evidence as to whether 
critical phenomena exist in the financial market or not. 
The observed out of equilibrium phase is a feature of 
the power law PDF of the share volume. The absolute 
value of price fluctuation causes a large fluctuation in the 
share volume, and large fluctuation of the absolute price 
causes a decrease in the number of trades in a specified 



time interval. 
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FIG. 5: (a) PDF P(E|[fl|) of £ for a given We ob- 

serve that rarely occurring high \R\ results in higher volume 
fluctuations, (b) PDF P{a\\R\) of a for a given \R\. The fluc- 
tuation of the number imbalance a is independent of \R\. (c) 
PDF P(q\\R\) of q for a given We observe that high |7?| 
prompts the average volume of shares per trade q to increase, 
(d) PDF P(N\\R\) of N for a given We observe a slight 
decrease in the number of transactions with an increase in 
\R\. 



